Statement of interest: inconsistency-tolerance in data integration systems

نویسنده

  • Riccardo Rosati
چکیده

The task of a data integration system is to combine the data residing at different, autonomous sources, and providing the user with a unified view of these data, called global schema. Users query the global schema, while the system carries out the task of suitably accessing different sources and assembling the data retrieved at each source into the final answer to the query. Since sources are in general autonomous subsystems, the information provided by the data at the sources are likely not to be consistent with the knowledge (constraints) expressed by the global schema. Current data integration technology is actually unable to handle sources that are inconsistent with the global schema: in fact, data integration systems mainly deal with this problem through a (static) data cleaning approach, i.e., data that has to be integrated is modified in order to recover consistency with respect to the global schema. However, in many situations it would be much more desirable to derive significant information from the database even in the presence of data inconsistent with the global schema. Indeed, in many application scenarios, the explicit repair of data is not convenient, or even impossible: e.g., in virtual data integration, sources are not controlled by the integration system, which in general is not allowed to modify source data; moreover, data are not materialized (copied) in the data integration system, thus they cannot be modified. On the other hand, following the ideas proposed by the research in consistent query answering in databases, it might be possible to develop query answering (and more generally data management) techniques that realize a dynamic, virtual repair of data. According to such an approach, data are not cleaned, and inconsistency is handled at query evaluation time, through suitable query answering methods which are able to extract significant information from a data integration system even in the presence of inconsistent data. Recent research in inconsistency-tolerance in databases and information systems has produced interesting results, and promising techniques (some of which are based on standard relational database technology) have been proposed. So, it would be interesting to discuss whether (and how) this kind of technology may now have a potential impact on commercial data integration systems and applications. In particular:

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inconsistency Tolerance in P2P Data Integration: An Epistemic Logic Approach

We study peer-to-peer data integration, where each peer models an autonomous system that exports data in terms of its own schema, and data interoperation is achieved by means of mappings among the peer schemas, rather than through a global schema. We propose a multi-modal epistemic semantics based on the idea that each peer is conceived as a rational agent that exchanges knowledge/belief with o...

متن کامل

Evaluation of Failure Causes in Employing Hospital Information Systems

Today, the information systems play a critical role in business for each organization. Like other organizations, hospitals use information systems for data collection, data storage, data processing and the like to have long-term and short-term achievements. Despite the very benefits of implementing HIS and its costly implementation, the HIS project sometimes fails. The importance of the HIS fai...

متن کامل

Academic Statement by Leopoldo Bertossi

(A) Data Management and Business Intelligence. Specific areas of interest and research have been: (a) Inconsistency management in databases. (b) Virtual data integration. (c) Multidimensional databases, in particular semantics problems and their impact on OLAP and data analytics. (d) Peer data exchange. (e) Contexts for data management. (f) Data quality assessment and data cleaning, in particul...

متن کامل

Introduction to Inconsistency Tolerance

Inconsistency arises in many areas in advanced computing. Examples include: Merging information from heterogeneous sources; Negotiation in multi-agent systems; Understanding natural language dialogues; and Commonsense reasoning in robotics. Often inconsistency is unwanted, for example, in the specification for a plan, or in sensor fusion in robotics. But sometimes inconsistency is useful, e.g. ...

متن کامل

Full routing and synoptic analysis A sample of studies of heavy rainfall systems in excess of 50 mm in southern Iran

Problem statement The occurrence of terrible floods due to climate change has caused much damages in different parts of the world in recent decades, and the effect of these changes is more pronounced in dry areas. Floods are the most common environmental damage. On average, 60 floods occur annually in Iran, with an average annual flood loss of 141 people, meaning more than 2 deaths per year pe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007